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We report the genome sequence of a healthcare-associated MRSA type ST239 clone isolated 
from a patient with septicemia in Malaysia. This clone typifies the characteristics of ST239 
lineage, including resistance to multiple antibiotics and antiseptics. 



Introduction 

Antibiotic resistance in S. aureus is a major con- 
cern, as an increasing number of infections are 
caused by methicillin-resistant S. aureus (MRSA). 
Figure 1 shows the phylogenetic position of S. 
aureus in relation to other staphylococci. In Ma- 
laysia, the incidence of MRSA-related infections is 
a cause of concern in hospitals country-wide. 
Health-associated MRSA (HA-MRSA) has been 
dominated by a few lineages in Southeast Asia, 
particularly ST239. Sequence type 239 is an inter- 
national healthcare-associated (HA) MRSA lineage 
prevalent in Asia, South America and Eastern Eu- 
rope, which includes EMRSA-1, -4, -7, and -11 and 
the Brazilian, Portuguese, Hungarian, and 
Viennese clones. Strains of type ST239 are typical- 
ly resistant to multiple classes of antibiotics and 
antiseptics such as p-lactam antibiotics. 

Classification and features 

We have chosen a representative of an MRSA 
strain, termed MRSA PR01 isolated from a patient 
with septicemia, isolated from a hospital in Kuala 
Lumpur. Table 1 indicates general information 
gathered on MRSA PR01. The MRSA PR01 strain 
has been identified as sequence type 239 (ST239) 
by multilocus sequence typing (MLST). Initial disc 
susceptibility tests showed that the strain is re- 
sistant to (B-lactam antibiotics oxacillin, ampicillin, 
cefuroxime, ceftriaxone, gentamicin, erythromy- 




cin, ciprofloxacin and co-trimoxazole. 

Genome sequencing information 

Genome project history 

This organism was selected for sequencing as a 
representative of MRSA infection in a local Malay- 
sian hospital. The genome sequences of this or- 
ganism were deposited in GenBank (WGS data- 
base). Sequencing, finishing and annotation were 
performed at the Pharmacogenomics Centre 
(PROMISE), UiTM. Table 2 presents the project in- 
formation and its association with MIGS version 
2.0 compliance [14]. 

Growth conditions and DNA isolation 

MRSA PR01 was grown overnight under aerobic 
conditions in Tryptic Soy Broth at 37°C. DNA ex- 
traction was performed using MasterPure™ Gram 
Positive DNA Purification Kit (Epicentre, Madison, 
USA) as per manufacturer's instructions. The con- 
centration and purity of resultant DNA was as- 
sessed by UV spectrophotometry (Nanodrop, 
Thermo Scientific). 5 \ig of genomic DNA (A260/280 
= 1.88) was used for library preparation. 

Genome sequencing and assembly 

The genome sequence was obtained using 104 Mb 
of paired-end (300 bp spacing) data from the 
Illumina GAII X platform (Illumina, San Diego, CA) 
with 36-bp reads. Sequence data were assembled 
using CLCBio Genomics Workbench (CLC bio, Aar- 
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hus, Denmark). One hundred and ninety-five 
contigs (N50: 13,272 bp) were generated, and 
were overlaid with the reference sequence Mu50 
using OSLay. Fourteen supercontigs were gener- 



ated as a result. Gaps were closed using Sanger 
sequencing. 
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Figure 1. Phylogenetic tree highlighting the position of Staphylococcus aureus strain PRO! relative to other 
type strains within the Staphylococcaceae. The strains and their corresponding GenBank accession numbers 
for 16S rRNA genes are: 5. aureus strain ATCC 12600, L36472; 5. saprophyticus strain ATCC 15305, 
AP008934; 5. epidermidis strain ATCC 14990, D83363; 5. hominis strain DSM 20328, X66101; 5. 
haemolyticus strain CCM2737, X66100; and 5. cohnii strain ATCC 49330, AB009936. The tree uses se- 
quences aligned by the RDP aligner, and uses the Jukes-Cantor corrected distance model to construct a dis- 
tance matrix based on alignment model positions without the use of alignment inserts, and uses a minimum 
comparable position of 200. The tree is built with RDP Tree Builder, which uses Weighbor [1] with an al- 
phabet size of 4 and length size of 1000. The building of the tree also involves a bootstrapping process re- 
peated 100 times to generate a majority consensus tree [2]. Staphylococcus lutrae (X84731) was used as an 
outgroup. 



Genome properties 

The MRSA PR01 genome consists of a 2,725,110- 
bp circular chromosome with a GC content of 
32.6% (Table 3). The MRSA PR01 genome con- 
tains 2668 CDs with 19 rRNA features ()■ A total of 
1722 (64.5%) of protein coding genes were as- 
signed to COGs, and a breakdown of the functional 
assignment of COG-assigned genes is shown in 
Table 4. Plasmid sequences were only partially 
sequenced. Figure 2 depicts genomic regions of in- 
terest found in the preliminary analysis of the 
MRSA PRO 1 genome. 

Initial analysis of the genome revealed several key 
features. This genome has a typical SCCmec type 
III cassette, containing cadmium resistance genes. 
SCCmec type III is a composite element that is 
comprised of SCCmec and SCCmercury. In the 
MRSA PR01 genome, like others, this region har- 
bors ccrC, pI258 and Tn554 as well as the genes 



involved in cadmium resistance. The MRSA PR01 
genome contains two pathogenicity islands, and 
several resistance features were identified such as 
the qacA gene, which confers resistance to anti- 
septics such as cationic biocides, quaternary am- 
monium salts, and diamidines via an export- 
mediated mechanism, and the norA gene which 
confers resistance to hydrophilic quinolones such 
as norfloxacin and ciprofloxacin. There were 9 re- 
gions defined as prophage regions by PHAST [17] 
with one complete prophage region.genes were 
identified in the genome. A total of 2,267 genes 
(72.66%) were assigned a putative function. The 
remaining genes were annotated as hypothetical 
proteins. The properties and the statistics of the 
genome are summarized in Table 3. The distribu- 
tion of genes into COGs and KEGG functional cate- 
gories is presented in Table 4. 
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Conclusion 

This study is the first to report on the whole ge- 
nome sequence of a Malaysian MRSA isolate. Pre- 
liminary analysis of the genome has highlighted 
the genetic determinants that are responsible for 
the organism to adapt easily to selective pres- 
sures. Further research is being conducted to pro- 
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vide insight on the adaptive power of this 
healthcare-associated strain to attain high re- 
sistance to antibiotics. 

Nucleotide sequence accession numbers. This 
Whole Genome Shotgun project has been deposit- 
ed at DDBJ/EMBL/GenBank under the accession 
ANPO00000000. The version described in this pa- 



per is the first version, ANPO01000000. 

Table 1 . Classification and general features of Staphylococcus aureus MRSA PRO! 



MIGS ID 


Property 


Term 


Evidence 
code 3 




Current classification 


Domain Bacteria 

Phylum Firmicutes 

Class Bacilli 

Order Bacillales 

Family Staphylococcaceae 

Genus Staphylococcus 

Species Staphylococcus aureus 

Type strain MRSA PRO! 


[3] 
[4-7] 
[8 9] 
[6,10] 
[9,11] 
[6,12] 
[6,12] 
TAS 




Gram stain 


Positive 


TAS 




Cell shape 


Coccus 


TAS 




Motility 


Non-motile 


TAS 




Sporulation 


Non-sporulating 


TAS 




Temperature range 


Mesophile 


TAS 




Optimum temperature 


30-37°C 


TAS 




Carbon source 


Glucose 


TAS 




Energy source 
Terminal electron re- 
ceptor 


Chemoorganotrophic 




MIGS-6 

MIGS-6.3 

MIGS-22 


Habitat 
Salinity 

Oxygen 


Human respiratory tract, skin 
Facultative anaerobe 


TAS 
TAS 


MIGS-15 


Biotic relationship 






MIGS-14 


Pathogenicity 


Opportunistic pathogen 


TAS 


MIGS-4 


Geographic location 


Malaysia 




MIGS-5 


Sample collection time 


May 2009 




MIGS-4.1 


Latitude 


4.1936°N 




MIGS-4.2 


Longitude 


103.7249°E 




MIGS-4.3 


Depth 


Not reported 




MIGS-4.4 


Altitude 


Not reported 





a Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature). These evi- 
dence codes are from the Gene Ontology project [19]. 
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Ta ble 2. Project information 



\ a i /" c in 
MICo ID 


Property 


Term 


MIGS-31 


Finishing quality 


Non-contiguous Finished 


1 VI 1 VJO A. O 


I inr^cioc r icon 


lino ^flhn Mil m i n 3 \. A 1 1 v nonnm \c inr^n/ 
\_Jl Itr _>JUUU IllUIIIIikl vj/\IIA tltrl lUI 1 1 1L. llJJidiy 


MIGS-29 


Sequencing platforms 


ill ' f — ■ aii 

lllumina GAIIx, Sanger 


MIGS-31 .2 


Fold coverage 


>200x 


MIGS-30 


A 1 I 

Assemblers 


CLCBio Genomics Workbench 


K A 1 t — f '1 f\ 

MIGS-32 


Gene calling method 


Glimmer and GeneMark 




Genbank ID 


ANPO01 000000 




Genbank Date of Release 


January 11, 2014 




GOLD ID 


Gi0037576 


MIGS-13 


Project relevance 


Medical, Tree of life 



23000000 

I I 




1400000 



Figure 2. Visual representation of the MRSA PR01 genome. From outer to inner tracks: Scale (in ba- 
ses); annotated CDSs colored according to predicted function (red, SCC element; blue, genomic is- 
land; green, transposon/ integrative conjugative element; purple, 5. aureus pathogenicity island 
[SaPI], brown, prophage); forward strand CDS; reverse strand CDS; GC skew. 
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Table 3. Nucleotide content and gene count 


levels of the MRSA PRO! 


genome 




Value 


/o Ul lUUll 


Genome size (bp) 


0 70 m m 

z,/z j, 1 1 u 


I UU 


DNA G+C content (bo) 


888,386 


32.6 


DNA Coding region (bp) 


2,555,544 


90.03 


Total genes 


2687 


100 


RNA genes 


19 


0.7 


Protein-coding genes 


2668 


99.3 


Genes with protein function prediction 


2,267 


72.66 


Genes assigned to COGs 


1722 


64.5 



a The total is based on either the size of the genome in base pairs or the total number 
of protein coding genes in the annotated genome 



Table 4. Number of 


genes associated with the 25 general COG functional categories 


Code 


Value 


%age a 


Description 


J 


140 


5.247 


Translation 


A 


_ 




RNA processing and modification 


K 


127 


4.760 


Transcription 


L 


126 


4.723 


Replication, recombination and repair 


B 






Chromatin structure and dynamics 


D 


23 


0.862 


Cell cycle control, mitosis and meiosis 


V 
i 






Mi ir D3r cIti tcti iro 
INULIcdl 3U ULLUIC 


V 

V 






nofoncD mof^M^nicmc 
L/tMcl Idc 1 1 IcCI Idl 1 lal 1 la 


T 


47 


1.762 


Signal transduction mechanisms 


M 


91 


3.41 1 


Cell wall/membrane biogenesis 


N 


4 


0.150 


Cell motility 


7 
Z. 


u 


u 


L,yrosKeieion 


w 


0 


0 


Extracellular structures 


u 


0 


0 


Intracellular trafficking and secretion 
Posttranslational modification, protein turnover, 


o 


72 


2.699 


chaperones 


c 


106 


3.973 


Energy production and conversion 


G 


129 


4.835 


Carbohydrate transport and metabolism 


E 


186 


6.972 


Amino acid transport and metabolism 


F 


68 


2.549 


Nucleotide transport and metabolism 


H 


83 


3.111 


Coenzyme transport and metabolism 


I 


62 


2.324 


Lipid transport and metabolism 


P 


123 


4.610 


Inorganic ion transport and metabolism 

Secondary metabolites biosynthesis, transport and ca- 


Q 


23 


0.862 


tabolism 


R 


193 


7.234 


General function prediction only 


S 


119 


4.460 
35.45 


Function unknown 




946 


7 


Not in COGs 


a The total 


is based on 


the total 


number of protein coding genes in the annotated genome. 
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Conclusion 

Description of Sulfurimonas hongkongensis 
sp. nov. 

Sulfurimonas hongkongensis (hong.kong.en'sis. 
N.L. fern. adj. hongkongensis pertaining to Hong 
Kong, the city where the type strain was isolated). 

Strain AST-10 T is rod-shaped with size of 0.2-0.4 
|im x 0.5-1.2 |im. It is an obligate anaerobe and oc- 
curs singly. The temperature range for growth is 
15-35°C, optimum at 30°C. The pH range for 
growth is 6.5-8.5, optimum at 7.0-7.5. The salinity 
range for growth is 10-60 g L 1 , and optimum at 30 
g L 1 . Strictly chemolithoautotrophic growth oc- 
curs with H2, HS- or S2O3 2 " as an electron donor 
and with nitrate as an electron acceptor. Nitrate is 
reduced to N2, and reduced sulfur compounds are 
oxidized into S° or SO4 2 " (depending on molar ratio 
of S203 2 /N03"). The major cellular fatty acids are 

Cl4:0, Cl6:0, 2"OH Cl6:0, Cl6:l, Cl8:0, and Cl8:l, With Cl6:0 

2-oh as a unique fatty acid different from other spe- 
cies in the genus Sulfurimonas. 

The type strain AST-10 T = DSM 2096 T = JCM 
18418 T , was isolated from coastal sediment at the 
Kai Tak Approach Channel connected to Victoria 
Harbour in Hong Kong, China. The GC content of 
the genome is 34.9%. The genome sequence has 
been deposited at DDBJ/EMBL/GenBank under 
accession number AUPZ00000000. 
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